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Abstract 

As the most fundamental empirical law, Zipf's law has been studied 
from many aspects. But its meaning is still an open problem. Some 
models have been constructed to explain Zipf's law. In the letter, a new 
concept named nonsymmetric entropy was introduced, maximizing non- 
symmetric entropy leads naturally to Zipf's law. 
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Zipf's law ( 1 Jwhich states that the frequency of a word decays as a power 
law of its rank, is regarded as a basic hypothesis with no need for explanation in 
recent models of the evolution of syntactic coinmunication( 2 ). As an empirical 
law, Zipf's law is the most fundamental fact in quantitative linguistics and other 
complex systems, its meaning is still an open problem which has been tried to 
explain from several aspects of its origins { E El El EI- h is necessary to find 
a suitable mechanism for Zipf's law. In the present article, we introduce a new 
concept named nonsymmetric entropy so that Zipf's law is derived naturally by 
maximizing the nonsymmetric entropy. 

We first give some hypothesis in a model of linguistics, in which the set 
of words is W = {wi,--- ,Wm}, a- text is a sequence sl of words, that is, 
sl = {si, ■ ■ ■ , Sl}, Si e W,i = 1, • • • ,L; Ni is the number of the word Wi in a 
text Sl, and pl(*) = ^ express the frequency of the word Wi in the text sl; 
as L — ^ oo, we assume p{i) = lim hence we have p(l) + • • • + p{m) ~ 1. 
Furthermore if we assume p{l) > p{2) > ■ ■ ■ > p{m), then Zipf's law can be 
written as a power law in the form: 

Pi^ = (1) 
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where a ~ 1 and is the probability of the most frequent word. 
Definition: we define a function 

- ,p(™)) = -^p(*)ln(ApW), (2) 

1=1 

where /3i > 0, i = 1, ■ • • , m, are nonsymmetric parameters. We call the function 
Sm the nonsymmetric entropy. 

Theorem: If {p(l), • • ■ ,p{m)} satisfies the following distribution 
then the nonsymmetric entropy take the maximum 

rn 

5„. = -lnp(l) = ln^-. (4) 



Proof: Instituting p{m) = 1 — p{l) — • • ■ — p(m — 1) into the Eq.ljSJ and 
setting its differential to zero yields 

that is, 

p(l) + --- + (l + -^)p(i) + ---+p(TO-l) = l, -,771-1. (6) 

Solving the equations system ®, we obtain 

1 (7) 

where p(l) = Denote Ak = {aij)kxk.aij = ^ + ^(^, since det Ak = 

-T^SfLi =i , where the hat means the corresponding item disap- 

pear, so from Utj = - df(ifop(j) = ^ + ^(k)' '^'^ ^"^^ ^^^^ matrix ^ = 
(aij)(m-i)x(m-i) is a positive defined matrix. Thus the distribution p{i) = 
— J- maximize the nonsymmetric entropy. The proof is completed. 

Corollary 1. If we take /3i — i", then we have 

Pi^)-^^-^^ i^l,---,-, (8) 



2 



in particular, we take a ~ 1, this is just the Zipf's law. If take /3i = {i + 7)", 
then we give Mandelbrot's law. 

Corollary 2. For Zipf's law, we have Sm+i > Sm, that is, the nonsymmet- 
ric entropy is increase as m increasing. 

Proof: It is easy to see from S„i = — Inp(l) = In^™ i. 

The above results suggest that the nonsymmetric entropy is a more funda- 
mental concept that will play an important role in the fields where Zipf's law 
emergences and references therein). The meaning of the nonsymmetric en- 
tropy needs a reasonable explanation. It is different with Shannon's entropy 
iS* = ~ Si^i I'^-PI*) some aspects. For example, if = 1 and others 
zeroes, then S = 0, but Sm — — ln/3j , this imphes that there exist some kinds of 
uncertain in some superficial reliable events under the nonsymmetric entropy. 
Other deep meanings of nonsymmetric entropy need more studies. Since the im- 
portant roles of Boltzman's entropy and Shannon's entropy in thermodynamics 
and information theory respectively, we tend to take maximal nonsymmetric 
entropy principle but Zipf's law as a basic principle. 
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